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The high variability of renewable energy resources presents significant challenges to the operation of the 
electric power grid. Conventional generators can be used to mitigate this variability but are costly to operate 
and produce carbon emissions. Energy storage provides a more environmentally friendly alternative, but 
is costly to deploy in large amounts. This paper studies the limits on the benefits of energy storage to 
renewable energy: How effective is storage at mitigating the adverse effects of renewable energy variability? 
How much storage is needed? What are the optimal control policies for operating storage? To provide answers 
to these questions, we first formulate the power flow in a single-bus power system with storage as an infinite 
horizon stochastic program. We find the optimal policies for arbitrary net renewable generation process when 
the cost function is the average conventional generation (environmental cost) and when it is the average 
loss of load probability (reliability cost). We obtain more refined results by considering the multi-timescale 
operation of the power system. We view the power flow in each timescale as the superposition of a predicted 
(deterministic) component and an prediction error (residual) component and formulate the residual power 
flow problem as an infinite horizon dynamic program. Assuming that the net generation prediction error is 
an IID process, we quantify the asymptotic benefits of storage. With the additional assumption of Laplace 
distributed prediction error, we obtain closed form expressions for the stationary distribution of storage and 
conventional generation. Finally, we propose a two-threshold policy that trades off conventional generation 
saving with loss of load probability. We illustrate our results and corroborate the IfD and Laplace assumptions 
numerically using datasets from CAISO and NREL. 
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1. Introduction 

The rapid increase in the world demand for electricity (EIA 2011, Figure 72) coupled with the 
need to reduce the high carbon emissions due to electric power generation from fossil fuel (EPA 
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2011, Table 3-7) are driving a dramatic increase in renewable energy generation from sources such 
as wind, solar, and hydro. The power generated from wind and solar, however, is intermittent and 
uncertain, which presents significant challenges to power system operation as the penetration of 
these sources increases (NREL 2010). In the long timescale (weeks to hours), this variability causes 
power imbalances: When renewable generation falls short of meeting the demand, more conventional 
generation from combined-cycle combustion and gas turbines is needed, which increases power 
system operation cost and offsets some of the environmental benefits of renewable energy (Hart 
and Jacobson 2011); when renewable generation exceeds demand, the excess power generated must 
be curtailed. In the short timescale (minutes to seconds), the variability of renewable generation 
can lead to large frequency and voltage variations and higher loss of load probability. 

In addition to using conventional generation, renewable energy variability can be mitigated 
architecturally via geographical generation diversity (NREL 2010) and renewable resource diversity 
(Li et al. 2009), and operationally using demand-response (Kirby and Milligan 2010) and energy 
storage (Bitar et al. 2011, Denholm et al. 2010). In particular, energy storage can help in two quite 
different ways (EPRI 2010). 

• In the long timescale, bulk energy storage systems, such as pumped hydroelectric storage and 
compressed air energy storage (CAES), can be charged by the excess renewable energy generation 
during off-peak hours and discharged during peak hours, hence reducing the need for additional 
conventional generation capacity, and renewable energy curtailment. 

• In the short timescale, fast-response energy storage systems, such as fiywheels and batteries, 
can also help improve reliability beyond what fast-ramping generation can achieve because of their 
much faster response time. 

This paper aims to establish the limits on the benefits of storage for mitigating renewable energy 
variability. How much can storage help reduce the need for conventional generation? How much 
can it help improve reliability? How much storage is needed to reap these benefits? What are the 
optimal control policies that achieve these limits? Satisfactory answers to these questions can help 
in architecting the smart grid as well as in operating it efficiently and reliably. Since we wish to 
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establish limits on the benefits of storage rather than analyzing the operation of a particular power 
system with storage, we will ignore the fixed and operating costs of storage as well as explicit 
economic benefits such as arbitrage (e.g. see Eyer and Corey (2010)) throughout this paper. Under 
certain assumptions, we will find that most of the benefits can be achieved with only a modest 
amount of storage. 

Following is an outline of the rest of the paper: 

• In Section 2, we consider a single-bus power system with storage and a slotted-time model for 
load and energy generation. We formulate the power flow problem as an infinite horizon average- 
cost stochastic program in which the input is the net renewable generation (difference between the 
renewable power generated and the demand) and the controls are conventional generation and the 
storage charging and discharging operations. We consider two cost functions, the expected average 
conventional generation (environmental cost) and the expected average loss of load probability 
(reliability cost). We find the optimal policies for each of these cost functions for arbitrary net 
renewable generation process (Theorems 1 and 2). The performance of these policies is illustrated 
using load data from CAISO (2012) and simulated renewable wind power generation data from 
NREL (Potter et al. 2008). 

• In Section 3, we make progress toward quantifying the degree to which storage can help 
mitigate the impacts of renewable generation and the amount of storage needed. We consider the 
multi-timescale operation of the power system (day-ahead, hour-ahead, minutes-ahead, and real 
time). For each timescale, we view power flow in the single-bus power system with storage as the 
superposition of a predicted (deterministic) component and an error (residual) random component. 
We assume that the predicted component is balanced (with possibly a fixed offset) and formulate 
the residual power flow problem as an infinite horizon average-cost dynamic program with the 
net renewable generation prediction error as input and fast-ramping generation and storage as 
control variables. Assuming that the net renewable generation error is an IID process, we show that 
storage can reduce fast-ramping generation (relative to no storage) by a factor that approaches the 
storage round-trip inefficiency as its capacity becomes large (Proposition 1). We further show that 
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storage can reduce the average loss of load probability to zero as storage capacity becomes large 
(Proposition 4). 

We then observe from the NREL simulated wind power dataset that in the short timescale 
(minutes), the wind power generation prediction error is close to Laplace distributed. Under this 
additional assumption, we obtain closed form expressions for the minimum average fast-ramping 
generation and the stationary distribution of the stored power sequence in some special cases 
(Propositions 2 and 3). We show that most of the possible reduction in conventional generation 
can be achieved with relatively small storage capacity. We also show that the average loss of load 
probability can be reduced by an order of magnitude with small storage capacity. 

• The optimal policies we establish in Section 2 represent two extremes: The policy that min- 
imizes the average conventional generation always uses the stored energy ahead of conventional 
generation and never uses conventional generation to charge storage, while the policy that mini- 
mizes the average loss of load probability uses conventional generation ahead of stored energy and 
to keep storage as full as possible. In Section 4, we present a two-threshold policy that includes 
these two policies as special cases. We also show that this policy minimizes the one-period weighted 
sum of the aforementioned cost functions and is optimal for all the numerical examples we tried. 
Using this policy, we find a tradeoff between conventional generation capacity and storage capacity 
needed to achieve prescribed conventional generation consumption and loss of load probability. 

• In Section 5 we generalize the policies established in Section 2 to the case where the storage 
charging and discharging rates are constrained (Theorems 3 and 4) . We find that most of the benefit 
to average conventional generation can be attained with relatively small charging and discharging 
rates, while higher such rates are needed for the average loss of load probability. 

This paper is a significantly reorganized and expanded version of the conference paper (Su and 
Gamal 2011). There is a large body of previous work on energy storage. An overview of energy 
storage technologies and applications can be found in EPRI (2010), Roberts and Sandberg (2011) 
and references therein. Prior related work to this paper include Chandy et al. (2010), Gayme and 
Topcu (2011), Oh (2011). Both Chandy et al. (2010) and Gayme and Topcu (2011) assume that 
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the net load is deterministic. Chandy et al. (2010) formulate a dynamic dc optimal power flow 
problem with energy storage as a convex program. The optimal policy is established explicitly 
for some special cases. Gayme and Topcu (2011) formulate a dynamic ac optimal power flow 
problem with energy storage. The problem is shown to be non-convex in general, and sufficient 
conditions for strong duality are established. Oh (2011) models renewable generation as a sequence 
of discrete random variables. An approximate stochastic programming method is proposed and 
illustrated via numerical examples. Concurrent and independent related work to this paper include 
Koutsopoulos et al. (2011) and ParandehGheibi et al. (2011) in which continuous-time models for 
the net renewable generation are considered with different problem formulations. 
2. Single-bus power system with storage 

Consider the single-bus electric power system with storage depicted in Figure 1, which consists 
of conventional generation, net renewable generation (diff'erence between the renewable generation 
and the load) , and energy storage. This power system may represent a transmission network with 
high renewable penetration, a distribution network with distributed renewable generators and 
energy storage devices, a microgrid not operated in the island mode where the power from the 
macrogrid acts as a fast-ramping generator (Lasseter et al. 2002) , a wind farm with energy storage 
devices in which generation is acquired through an electricity market, or a stand-alone hybrid 
renewable energy system with battery storage (Bernal-Agustm and Dufo-Lopez 2009, Deshmukh 
and Deshmukh 2008). The conventional generation may include base-load generators (coal-flred, 
hydro, and nuclear power plants), intermediate generators (combined-cycle combustion turbine), 
and peaking and fast-ramping generators (gas turbines). The renewable generation may include 
wind and solar. The numerical results in this paper assume only wind power. The energy storage 
may include bulk energy storage (compressed air and pumped hydroelectric storage) and fast- 
response energy storage (flywheels and batteries). 

We assume a slotted-time model for the dynamics of the power system, where time is divided 
into slots each of length r hours and power is constant over each time slot. In the following, we 
introduce the needed definitions and the assumptions used throughout the paper. 
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Figure 1 Single-bus power system with storage. 
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• The power suppHed by the net renewable generation (difference between the renewable gen- 
eration power and the load) in time slot i = 1, 2, . . . is denoted by Aj. The sequence Ai, A2, ... is 
in general a random process. 

• We denote the total power capacity of conventional generation by G^ax MW. The power 
supplied in time slot i = l,2,... is denoted by Gi < Gmax- 

We characterized energy storage by the following parameters: 

• The energy storage capacity rS^ax MW-h is the maximum amount of energy that can be 
stored, where Smax is referred to as the power storage capacity. Real-world energy storage devices 
cannot be completely discharged, and there is a limit on their minimum energy level. We use this 
minimum level as a reference and assume without loss of generality that it is equal to zero. 

• The stored power at the beginning of time slot i = 1, 2, . . . is denoted by Si MW. 

• The rated storage power conversion C^ax MW is the maximum input (charging) power. The 
charging power at time i = 1, 2, . . . is denoted by Ci < Cmax- 

• The rated storage output power -Dmax MW is the maximum output (discharging) power. The 
discharging power at time i = 1, 2, . . . is denoted by Di < Dmax- 

• The charging efficiency G (0, 1) is the ratio of the charged power to the input power. The 
discharging efficiency G (0, 1) is the ratio of the output power to the discharged power. The 
round-trip efficiency therefore is a = Octtd- 

• The storage efficiency is the fraction of retained power over a time slot. We assume throughout 
that the storage efficiency is very high compared to the round-trip efficiency and assume that it is 
equal to one. 
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Using the above definitions, we can express the dynamics of the stored energy as 

Si+i = Si + a^Ci- —Di fori = 1,2,... 

with the constraints 0< Si< /Smax, < Cj < Cmax, and < Dj < Dmax, where Si G [0, ^^ax] is given. 

In Sections 2 through 4, we will assume unconstrained rated output power and power conversion 
of the energy storage, i.e., 

^c^max -^max '^max* (1) 

ad 

The general case is discussed in Section 5. 

We assume that the (negative) net renewable generation power is to be balanced as much as 
possible by conventional generation and stored power: 

• If Aj > — Gmax — niin{adS'j,Dmax}, then there is sufficient power capacity and the balance 
constraint must be satisfied, i.e., Gj + L'j — Cj + Aj > 0. Note that if Gj + L'j — Cj + Aj > 0, then 
there is excess generation. We assume that this excess generation is curtailed. 

• If Aj < — Gmax — niinjad'S'i, -Dmax}, then loss of load occurs. In this case, conventional generation 
is at its power capacity and storage is discharged at the rated storage output power, i.e., G, = Gmax, 
Cj = 0, and A = min{Q;d'S'j, I>max}- 

In the following two subsections, we formulate the power flow problem in the single-bus sys- 
tem as infinite horizon stochastic programs with two different cost functions. We establish the 
optimal control policies for both cost functions for arbitrary net renewable generation process. In 
Subsection 2.3, we illustrate these policies using datasets from CAISO and NREL. 

2.1. Minimizing average conventional generation 

The first cost function we consider is the expected long term average conventional generation. This 
is motivated by the need to reduce the carbon emissions of conventional generation. We seek to 
minimize this cost function by controlling the amount of conventional generation Gj, charging Cj, 
and discharging Di used in each time slot i = 1,2, . . ., where the triple {Gi,Ci, Di) is a function of 



8 



Su and El Gamal: Limits on the Benefits of Energy Storage for Renewable Integration 



the history Hi = (S*!, Ai, 5*2, A2, . . . , Si-i,Ai_i,Si). A control policy tt is a sequence of these triples, 
i.e., TT = {{Gi, Ci, Di) : i = 1, 2, . . .}. A policy is said to be stationary if TTj = tTj for all i and j. We 
are now ready to define the first optimization problem. 

Stochastic program I: Minimizing average conventional generation. 



minimize J7''^^^(7r,5'i) = limsupE 



1 " 
n ^-^ 



subject to iSj+i = Si + a^Ci — ^-Dj, 

< iSj < iSmax) < Ci, 

0<G'i<G^ax, 0<A, 

0<Gi-Ci + Di + max{Aj, -G^ax - o^dSi}, 

where the expectation is over the net renewable generation sequence A^, i = 1,2, We denote 

the optimal policy by 7r^^\ 

It turns out that a simple stationary policy is optimal for arbitrary net renewable generation 
process (including deterministic sequences). 

Theorem 1. The optimal policy tt^^^ for stochastic program I is given in Table 1. 

The optimal policy is illustrated by the "phase-diagram" in Figure 2. When the prediction error 
5 >0, the optimal policy charges the storage using the excess renewable generation as much as 
possible. When the prediction error S <0, the storage is first discharged to compensate for as much 
of the renewable power deficit as possible. Conventional generation is then used to compensate for 
the remaining renewable generation deficit (if any). Thus, the optimal policy tt^^^ never charges 
the storage using conventional generation. This greedy policy is a consequence of the linearity of 
the cost function and the imperfect round-trip storage efficiency as will become clear in the proof. 

To prove the theorem, consider the finite horizon counterpart of stochastic program I. Define 
the cost-to-go function 

i=k 
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Table 1 Optimal policy in Theorem 1 and corresponding stored power dynamics. 
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Figure 2 Illustration of the optimal policy TT^i) in Theorem 1. The value in each region corresponds to the stored 
power at the end of slot i when the stored power and prediction error at the beginning of this slot are 
Si and Ai, respectively. 




Let TT* be the policy achieving inf^ jl^\Tr, Hk). For a fixed pair (Hk-i, Ak^i), we will use the shorter 
notation J^^^(7r*, 5^) in place of jI^\'k* , Hi-_i, A^-i, Sk)- In the following lemma, we establish key 
properties of the cost-to-go function. The rest of the proof of Theorem 1 is given in Appendix A. 

Lemma 1 . The minimum cost-to-go function must satisfy the conditions: 

1. jl^\Tr*,Sk + c) < jl^\Tr*,Sk) for < c < S^ax — Sk, i.e., excess energy is not curtailed until 
storage is fully charged.. 

2. j'^\'K*,Sk + c) > j'^\'K*,Sk) — CKdC for < c < Smax — Sk, i-c., the cost of using conventional 
generation to charge storage by c MW is cja^, but the reduction in cost is at most a^c. 

To prove part 1, given the optimal policy tt*, we find another policy tt such that J^^ij^., 5'fe+c) = 
jl^\TT*,Sk)- Let Hi and H* be the history sequences under the policy tt and tt*, respectively. For 
fixed {Hk-i,Ak-i), consider Hk = {Hk-i,Ak,Sk + c) and = {Hk-i, Ak, Sk). Let 
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Gi = G*, Di = D* C, = min C;,— 5^ax-.Si + — A* 



1 



1 



(2) 



The policy tt is illustrated in Figure 3(a). By induction, the stored power under policy tt is always 
higher than that under policy tt*. Thus, by definition, j\^\'k* ,Sk) = Jk\T^, Sk + c)> J'^\tt* ,Sk + c). 

To prove part 2 of the lemma, given the optimal policy tt*, we find a policy tt such that 
4'^(7r, Sk) < Ji^\7r*,Sk + c) + Q!dC. For fixed (iJ^.i, Afc_i), consider = {Hk-uAk, Sk) and H; = 
{Hk-i,Ak,Sk + c). Let 

a = C*, Di = mm{D*,aA{Si + a,C:)}, = min{G* + L>* - A,G'^ax}. (3) 

The policy tt is illustrated in Figure 3(b). By induction, the stored power under poHcy tt is always 
lower than the stored power under policy tt*. The cost-to-go function of policy tt can be upper 
bounded as 



ji"{7r,S,)<E 



i=k 



^ (tt* , + c) + E [ (5* - 5fe) - {Sl^, - 5„+i) | H,] 



<j'^\'iT\Sk + c) + aaC. 



Figure 3 Illustration for the proof of Lemma 1. 
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2.2. Minimizing average loss of load probability 

As we mentioned, energy storage can be used to improve power system reliability in the presence 
of renewable energy variation. As a measure of system reliability, we use the average loss of load 
probability. Define the loss of load cost at time i as 

1 if Ai < -G 

l{A,<-G.3.-min{aaS.i..a.}} = |o OtherwisC. ^' 

We wish to find the control policy tt = {(Gj,Ci,Dj) : i = 1,2,...} that minimizes the expected 
average loss of load cost. 

Stochastic program II: Minimizing average loss of load. 



minimize J^'^^ (tt. Si) = lim sup E 



1 " 

n 5^ 



^ ^ -{Ai<-Gmax-ad'Sj} 
i=l 



subject to Si+i = Si + a^Ci 



0<Gi<G^a., 0<A 



< Gj - Ci + A + max{Aj, -G^ax - ad-Si}, 

where the expectation is over the net renewable generation sequence Aj, i = 1,2, We denote 

the optimal policy by tt^^^. 

The optimal policy for this program is simply to keep the stored power as high as possible by 
using both excess renewable and conventional generation. The policy again holds for an arbitrary 
net renewable generation process. 

Theorem 2. The optimal policy tt^^^ for stochastic program II is given in Table 2. 

Since there is no conventional generation cost, in some cases the optimal policy charges the 
storage with conventional generation to minimize the loss of load probability as illustrated in the 
phase-diagram in Figure 4,. 
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Table 2 Optimal policy in Theorem 2 and corresponding stored power. 
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Figure 4 Illustration of the optimal policy in Theorem 2. 
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To prove the theorem, we again consider its finite horizon counterpart and define the cost-to-go 
function as 



< — Gmax — oC(^Si} 



i—k 



Let TT* be the pohcy achieving inf^ J^^^(7r, ilfe). For a fixed pair Afc_i), we will used the 

shorter notation Jfc^^(vr, 5*^) in place of J^^^(7r, iJfe_i, Afe_i, 5fe). The key step in the proof is to show 



jf V% -^fc + c) < ^'"iT^^Sk) for < c < S^,^ - Sk. 



7-(2) 



(5) 



This follows since the policy vr defined in (2) always has higher stored power than the optimal policy 
TT*. Since the loss of load probability cannot increase if the amount of stored power is increased, 
we have J^^^ (tt* ,Sk)> j)p (tt, Sk + c)> jj^^ (tt* ,Sk + c). The rest of the proof of Theorem 2 is given 
in Appendix A. 
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2.3. Numerical results 

To investigate the reduction in conventional generation and loss of load probability attained by 
using energy storage, we use the simulated Western Wind Dataset from NREL (Potter et al. 2008) 
and the actual load dataset from CAISO (2012). The NREL dataset is based on numerical weather 
prediction (NWP) models. It attempts to recreate the potential wind power generation of more 
than 30 000 sites with ten 3 MW turbines at each location in the western U.S from 2004 to 2006, 
and the wind power data is sampled every 10 minutes. The CAISO load dataset includes the hourly 
load in California in 2004. Figure 5 plots the total simulated hourly power output of the wind 
turbines and the hourly load in California. 

Figure 5 The hourly average of the aggregate wind power of 3045 sites in Cahfornia from NREL dataset and the 
actual load from CAISO dataset for three months in 2004. The average wind power is 25624 MW, and 
average load is 27232 MW. 
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Figure 6(a) shows the average conventional generation under the optimal policy tt'^^ for several 
values of power storage capacities and round-trip storage efficiencies a = 60% and 80%. As can be 
seen from the figure, conventional generation consumption can be reduced using storage by 44% 
and 53% for a = 60% and 80%, respectively, when the storage capacity is 60 times the total rated 
power of the wind turbines, which is equivalent to 914 GW-h of storage. Note that 80% of this 
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reduction can be achieved by power storage capacity equal to 15 times the total rated power of the 
wind turbines, i.e., 228 GW-h. 

Figure 6(b) plots the average loss of load probability under the optimal policy tt^^^ for several 
values of power storage capacities and round-trip storage efficiencies a = 60% and 80%. For large 
'S'max, the reduction in the loss of load probability is roughly exponential in Smax- We also see that 
the average loss of load probability can be reduced by an order of magnitude with power storage 
capacity less than 2 standard deviations of the net renewable generation process. 

Figure 6 Figures (a) and (b) show the minimum average conventional generation and minimum loss of load 
probabihty, respectively, for NREL and CAISO data versus power storage capacity with round-trip 
efficiencies a = 60% and 80% and Gmax = 40000 MW. 



(a) (b) 
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3. Residual power system 

In the previous section, we formulated the single-bus power system stochastic programs I and II and 
found the optimal policies tt^^^ and tt^^^ that minimize expected average conventional generation 
and average loss of load probability, respectively, for arbitrary net renewable generation process. 
These policies provide answers to the question concerning optimal storage control policies. Although 
answers to the other questions we seek to answer can be obtained numerically as illustrated in 
Subsection 2.3, the results lump together vastly different types of conventional generation and 
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storage resources that are deployed at very different timescales. Electric power systems are typically 
operated in multiple timescales as illustrated in Figure 7: 

• Day-ahead: Each day, an hourly prediction of the net load (the negative of the net renewable 
generation) for the next day is made. Base generation and bulk storage are scheduled to meet this 
prediction. 

• Hour-ahead: Each hour, a refined prediction of the net load in the next hour is made. Peaking 
generation and medium-response storage are scheduled to meet the difference (prediction error) 
between the day-ahead and hour-ahead prediction. 

• Minutes-ahead: Every few minutes, a prediction of the net load in the next few minutes is 
made. Fast-ramping generation and fast-response storage are scheduled to balance the difference 
between the minutes-ahead and hour-ahead prediction. 

• Real-time: The scheduled generation and storage are operated. The deviation of actual net 
load from the minutes-ahead prediction is matched by additional fast-ramping generation and 
fast-response storage. 

Figure 7 Multi-timescale power grid operation. 
MW 
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10- minute- ahead 
Hom-aiiead 
Day-ahead 



' 1 \ \ — '^H \ \ 1 ►time 

K Operating hour h 

Consider the multi-timescale power system operation. The day-ahead power flow prediction can 
be modeled in the same manner as the single-bus power system discussed in the previous section. 
In each subsequent timescale, the system can be decomposed into a scheduled (deterministic) part 
and a residual (random) part as depicted in Figure 8. We assume that the scheduled generation 
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and storage power balance the predicted net renewable generation power; hence we can model the 
residual power system in exactly the same manner as the original system studied in Section 2, 
except that the input is now the net renewable generation prediction error and the controls are fast- 
ramping generation and the charging and discharging of the fast-response energy storage. Unlike 
net renewable generation, which is in general a messy stochastic process, under good prediction, the 
prediction error process can be modeled as an IID zero-mean process Aj, i = 1, 2, . . ., with variance 
Var[Aj] = (see Subsection 3.4 for numerical justification of this assumption). In Subsection 3.3, 
we consider the case of over provisioning in which more power is scheduled than the predicted net 
renewable generation, that is, the mean of the prediction error E[Aj] > 0. 



Figure 8 Decomposition of the total single-bus power system into a scheduled (predicted) power component and 
a residual power system component. 
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3.1. Minimizing average fast-ramping generation 

We first consider the role of fast-response energy storage in reducing the required fast-ramping 
generation. The formulation of this problem is the same as stochastic program I, but with the 
additional assumption that Aj, i = 1, 2, . . ., is an IID process with zero-mean and variance Var[Aj] = 
(7^. We refer to this new problem as dynamic program I. Since the optimal policy tt^^' in Theorem 1 
for stochastic program I holds for any net renewable generation process, it is optimal for dynamic 
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program I. The IID assumption, however, allows us to provide some answers to the question of 
how much storage can help. 

Consider the extreme case in which the fast-ramping generation capacity Gmax is unlimited. If 
there is no storage, i.e., S'max = 0, then it is not difficult to see that the average cost is E[A~], where 
x~ = max{— a:;,0}. The minimum expected average fast-ramping generation for unlimited storage 
is given in the following. 



Proposition 1. For unlimited Gmax cind Smax; the minimum average cost is J'^^\Tr^^\ Si) = (1 — 
a)E[A-]. 



The proof of this proposition is given in Appendix B. 

Comparing the average costs for no storage to unlimited power storage capacity, this proposition 
shows that storage can reduce the amount of needed fast-ramping generation (relative to no storage) 
in the limit by a factor equal to the round-trip storage inefficiency. This is not surprising because 
the IID zero-mean prediction error assumption implies that over the long term, the excess energy 
is roughly equal to the deficit. With infinite capacity and a = 1, storage can compensate for almost 
all the variation in renewable generation. However, when a < 1, it can compensate for at most this 
fraction of the variation and the rest needs to be compensated for by fast-ramping generation. 

In exploring the wind generation prediction and prediction error data obtained using the NREL 
dataset (see Subsection 3.4 for details), we found that the first-order distribution of the 10-minute 
ahead prediction error Aj can be well-approximated by a Laplace(A) random variable with proba- 
bility density function (pdf) /a((^) = e^^l''l/2A. With this additional assumptions we can obtain the 
closed form expressions for the average fast-ramping generation and the stationary distributions 
of the stored power and fast-ramping generation in the following propositions. 

Proposition 2. The minimum expected average fast-ramping generation under the Laplace 
assumption is 
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The proof of this proposition is given in Appendix B. 

Using this result, we can obtain an answer to the question of how much fast-response storage is 
needed. Consider the derivative of the optimal cost function at storage capacity S^ax, 



dS„ 



ad(l - e-^^-"=''')(l - Q;)2e-(l/ac-ad)ASmax/2 



4(1 — Q;e~(l/"'^~"d)>'Smax/2)2 

Since this derivative decreases close to exponentially, a small storage capacity is sufficient to achieve 
most of the reduction in the fast-ramping generation. For example, for a typical round-trip storage 
efficiency of 60%-80% (see Schainker (2004)), 80% of the possible reduction in the cost function 
can be achieved with power storage capacity less than 4 standard deviations of the prediction error. 

Note that under the optimal policy in Theorem 1, the stored power sequence is a homogeneous 
Markov process. In the following, we find the stationary distribution for this Markov process 
under the Laplace assumption. Furthermore, using this stationary distribution, we can find the 
distribution of the fast-ramping generation. 

Proposition 3. The cdf of the stationary distribution of the stored power under the optimal policy 

in Theorem 1 and the Laplace assumption is 

1- 0-5(1 + a)e-(i/°<=-"d)As/2 

~ 1 _ Q;e-(V«c-ad)ASmax/2 

for < s < Smax; Fs{s) = for s < 0, and Fs{s) = 1 for s > 5max- The corresponding distribution 
of the fast-ramping generation is 

FMq) = 1 ^— ^ e-^s 

^G\y) J- 2(l-Q;e-(V«c-ad)AS„,ax/2)'' 

forO<g< Gmax, Fc{g) = for g<0, and Fc{g) = 1 for g> G^ax- 

The proof of this proposition is given in Appendix B. 

Using the stationary distribution of the stored power sequence in Proposition 3, we can readily 
find the following expression for the expected average loss of load probability 



\ Qg — (l/cfc — ad)A'S'max/2 
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With no storage, the expected loss of load probability is e 



-AG, 



max 



/2. As Smax OO, 



^ (1 -a)e 



-AG, 



max 



/2. Thus, under policy 7r^^\ storage in the limit can reduce the 



expected average loss of load probability also by the round-trip storage inefficiency. 
3.2. Minimizing the loss of load probability 

We now consider the role of fast-response energy storage in reducing the average loss of load. The 
formulation of this problem is the same as stochastic program II with the additional assumption 
that Aj, i = 1, 2, . . ., is an IID process with zero-mean and variance Var[Aj] = cr^. We refer to this 
new problem as dynamic program II. Since the optimal policy vr*^^' in Theorem 2 for stochastic 
program II holds for any net renewable generation process, it is optimal for dynamic program II. 
In the following, we show that the benefit of storage to the loss of load probability is unbounded. 

Proposition 4. For unlimited storage capacity S^.^, if and G^ax satisfy the conditions 



for some constant c>0, then J'^^^(7r^^\S'i) = 0. 

The proof of this proposition is given in Appendix B. Note that (6) requires /a with fast diminishing 
tail, and (7) requires large enough Gmax- 

Unlike minimization of the expected average fast-ramping generation, we are not able to find 
closed form expressions for the optimal cost function or the stationary distributions under the 
Laplace assumption. However, it can be verified that for a > e~^'^™''=', Laplace(A) satisfies the 
sufficient conditions in Proposition 4, thus the expected average loss of load probability tends to 
zero as Smax — >■ oo. Furthermore, in the following we show that the convergence rate of the loss of 
load probability is exponential in Smax- 

Proposition 5. For a > e""*"^™, the exponent of the minimum expected average loss of load 
under the Laplace assumption decreases linearly as S'max increases, i.e., 



limsupxF^(—x) < c. 



(6) 



E [a{G^^ + A)+ - {G^^ + A)-] > 0, 



(7) 




<7. 



max 



max 
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for some constants — oo < 7min < 7max < 0. 

The proof is given in Appendix B. 

Note that when there is no fast-ramping generation, i.e., Gmax = 0, the sufficient condition in 
Proposition 4 given by (7) does not hold. However, the optimal policy reduces to a special case of 
the optimal policy in Theorem 1. Thus, storage can only reduce the expected loss of load probability 
by a factor no smaller than the round-trip storage inefficiency. 

3.3. Over-provisioned net generation prediction error 

Suppose that the net renewable generation prediction error A, has mean E[Aj] = /x > and variance 
Var[Aj] = (7^. In the following, we find the minimum expected average fast-ramping generation for 
unlimited G^ax and S'^ax- 

Proposition 6. For unlimited Gmax and S^^x, the minimum average cost is J7'^^^(7r^^\ Si) = 
(E[A--aA+])+. 

With no storage, i.e., Smax = 0, the minimum average cost is J'^^\7r''^\ Si) = E[A"]. Thus, storage 
can reduce the expected average fast-ramping generation J^^^ by a factor of 1 — aE[A+]/E[A~] 
for small jj, and reduce J'^^^ to for large /j,. 

For minimizing the expected loss of load probability. Proposition 4 still holds. Note that the 
minimum requirement on Gmax in (7) decreases linearly in /x. 

3.4. Numerical results 

First we provide numerical justifications for the IID and Laplace assumptions introduced in pre- 
vious subsections. We use the NREL dataset for the 50 highest power density offshore wind sites 
in California. We assume that the variations in demand are much smaller than in the wind power, 
which is justified by the high penetration scenario assumed this paper (also see Figure 5). Hence, 
we assume the net renewable generation error in our model is equal to the wind power prediction 
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error. Since the dataset we use does not include forecast data, we use a simple linear predictor. Fig- 
ure 9(a) plots the total power output of the 50 sites over a two-week period. Figures 9(b) and 9(c) 
plot the 10-minute-ahead prediction and the prediction error sequences for the same two-week 
period, respectively. 



Figure 9 Figure (a) shows the wind power over a two-week period. The average wind power is 560.26 MW. 

Figure (b) shows the 10-rninutc-ahead prediction given by the hncar predictor based on the 6 samples in 
the past hour and optimized for the one-year data in 2004. The prediction error is shown in Figure (c). 
The mean absolute value of the prediction error is 13.99 MW, and the standard deviation of the 
prediction error is 20.88 MW. 

(a) Wind power 
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(c) Prediction error 
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To test the IID assumption, we generated a sequence of IID random variables distributed accord- 
ing to the empirical marginal distribution of the prediction error sequence from the NREL dataset. 
Figure 10 compares the expected average fast-ramping generation costs for the NREL dataset and 
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the IID sequence using policy tt^^' . The maximum absolute difference between costs for the NREL 
dataset and for the IID sequence normalized by the NREL dataset cost is less than 3% and 6% for 
a = 60% and a = 80%, respectively. 

To test the Laplace assumption, in Figure 11 we compare the empirical pdf of the prediction 
error dataset to the best fit Laplace distribution. The maximum absolute difference between the 
empirical cdf of the prediction error and the Laplace cdf is 0.018. Hence, the assumption of IID 
Laplace distributed net renewable generation prediction error appears to be reasonable. In the 
following, we will further corroborate this assumption with the NREL data using the average cost 
results. 



Figure 10 The optimal expected average fast-ramping generation for the wind data versus power storage capacity 
for round-trip efficiencies a = 60% and 80% and Gmax = 160 MW. 
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Now we illustrate the results in Subsections 3.1 and 3.2. Figure 12(a) compares the minimum 
average costs in Proposition 2 and using the three-year simulated wind data for various values 
of power storage capacities and round-trip storage efficiencies a = 60% and 80%. The maximum 
absolute difference between the theoretical and the simulated costs normalized by the theoretical 
cost is less than 6% and 8% for a = 60% and a = 80%, respectively. Thus, the Laplace distribution 
appears to be an acceptable approximation of the simulated wind generation data from NREL. 
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Figure 11 The empirical pdf of the 10-minute-ahead wind power prediction error for three years versus the best 
fit Laplace(A) pdf with 1/A = 13.99. 
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Note that 80% of the reduction for unlimited power storage capacity can be achieved with power 
storage capacity less than 4 standard deviations of the prediction error, which is equivalent to 13.9 
MW-h. 

Figure 13(a) compares the empirical pdf of the stored power of the simulated wind genera- 
tion data to the stationary pdf under the Laplace distribution assumption in Proposition 3. The 
corresponding empirical pdf of the fast-ramping generation and its stationary pdf are shown in 
Figure 13(b). Note again the simulation results corroborate well with the theory. 

The loss of load probability depends on the tail of the cdf of the net generation prediction 
error. However, the number of samples of the wind data is small, and thus it is difficult to compare 
the loss of load probabilities of the wind dataset and the Laplace-distributed prediction error 
sequence. To illustrate the loss of load probabilities for the residual power system, we will assume 
that the net generation prediction error sequence is distributed according to the corresponding 
best fit Laplace distribution. Then the expected loss of load probability can be expressed as 

1 " 

(tt, 5i) = lim sup - FA(-G„ax - a^Si), 

1=1 

where is the cdf of the best fit Laplace distribution. Figure 12(b) compares the expected loss of 
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load probabilities and using the three-year simulated wind data for various values of power storage 
capacities and round-trip storage efficiencies a = 60% and 80%. For large iSmax, the average loss of 
load probability decreases exponentially in 5max as expected in Proposition 5. 

Figure 12 Figures (a) and (b) show the optimal expected average fast-ramping generation and the optimal loss of 
load probability, respectively, for the wind data versus power storage capacity for round-trip efficiencies 
a = 60% and 80% and Gmax = 160 MW. 



(a) (b) 




Power storage capacity Smax (MW) Power storage capacity Smax (MW) 



Figure 13 Figures (a) and (b) show the pdfs of the stored power and the fast-ramping generation, respectively, 
for a = 60%, Smax = 100 MW, and G^ax = 160 MW. 
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(b) Fast-ramping generation 
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4. Two-threshold policy 

In Section 2, we showed that the pohcy tt^^^ minimizes the expected average conventional (or fast- 
ramping) generation — storage is discharged before conventional generation is used and conventional 
generation is never used to charge storage. We also showed that the policy tt^^^ minimizes the 
average loss of load probability — conventional generation is used before storage is discharged and 
conventional generation is used to keep storage as full as possible. In this section, we propose a 
general two-threshold policy, which includes tt^^^ and tt'^^ as extreme special cases. This policy tries 
to keep the stored power above a charging threshold and tries not to discharge storage below a 
discharging threshold. 

Let < S'c < 5d < Sjnax- The two-threshold policy is characterized by a charging threshold Sc and 
a discharging threshold as given in Table 3 and illustrated in Figure 14 for Sc < QJcGmax- When 
the prediction error A > 0, the storage is charged as much as possible using excess net renewable 
generation. If the stored power after this charging is above ^c, then conventional generation is 
not used. However, if it is below Sc, then the storage is charged as close to 5c as possible using 
conventional generation. When the prediction error A < — Gmax, the storage must be discharged 
to balance the prediction error. If there is still unbalanced prediction error after the storage is 
discharged to S^, then fast-ramping generation is used such that the stored power is as close to S^ 
as possible. When the prediction error — Gmax < A < 0, the case where the stored power is either 
lower than 5c or higher than 5d is similar to the above cases. When the stored power is between 
5c and 5<j, only the fast-ramping generation is used to balance the prediction error, and the stored 
power is unchanged. 

Note that the optimal policies tt^^^ and tt^^^ for the two extreme cases in Theorem 1 and 2 are 
special cases of this policy with thresholds (0,0) and (5max, 5max), respectively. 
Next we consider the general weighted-sum average cost: 

J(7r, 5i) = (tt, 5i) + p2 J*'^ (vr, 5i), 
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Table 3 Two-threshold policy parameterized by {Sc,Sd)- 
(a) Q<Si<Sc 
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Figure 14 Illustration of the two-threshold policy. 
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where pi > and p2 > 0. In the following proposition, we show that the two-threshold policy is 
optimal for the residual power system dynamic program with the above general weighted-sum cost 
and two slots. 

Proposition 7. // the pdf of the prediction error /a increases on (— oo,0], then there exist Sd 
and such that < < < Smax? o-nd the optimal policy for the general residual power 
system dynamic program with two slots is the two-threshold policy with parameters {Sci,Sdi) for 
time i = l,2. 

The proof of this proposition is given in Appendix C. 

To demonstrate the two-threshold policy, we implemented a dynamic programming method by 
discretizing the state space and then running the value iteration (Bertsekas 2007) . For the values of 
Gmax, Sjnax, Po, ^nd pi used in the following numerical examples, we find that the policy obtained 
from the value iteration is a discretized two-threshold policy. 

Tradeoff between j'^' and J^^^: Fi gure 15 shows the tradeoff between the fast-ramping generation 
and the loss of load probability for no storage and for storage capacities S^ax = 50 MW and 
^max = 100 MW with fast-ramping generation capacity Gmax = 160 MW. Note that the results for 
the Laplace pdf corroborate very well with the simulated wind generation data. As shown in the 
figure the loss of load probability is improved by more than two orders of magnitude by using 
power storage capacity less than 5 standard deviations of the prediction error. 
Tradeoff between S^nax and G^naK- In Figure 16, we compare the two ways of mitigating renew- 
able energy variability; using fast-ramping generation and using fast-response storage. We fix the 
expected average fast-ramping generation at 3.6 MW (corresponding to 80% maximum reduction 
in the fast-ramping generation) and the loss of load probability at 2 ■ 10~^ (corresponds to one loss 
of load event every 10 years). To achieve these goals with minimum power storage capacity, we 
need Gmax = 170 MW and S'max = 60 MW. To reduce the fossil fuel generation and to achieve the 
same goals, we can replace 1 MW of Gmax with 1.3 MW of S'max- 
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Figure 15 The tradeoff between the fast-ramping generation and the loss of load probability for Gmax = 160 MW 
under two-threshold policies. 



7 

S 6 
5 \- 

a 

d 4 - 

a 



bJO 



a 2 - 



-CS 1 





le-08 



— 1 1 1 1 — I — I I I I 



—1 1 1 1 — I — I I I I 



— 1 1 1 1 — I — I I I 



Wind datasct Smax = ^ 

Wind datasct Smax — 50 — x- 
Wind dataset Smax = 100 - - - 



—I I I I I ' ' ' ' 



_i I I I I ' ' ' ' 



—I I I I I ' ' ' 



le-07 le-06 
Loss of load probability J^^^ 



le-05 



Figure 16 The tradeoff between the fast-ramping generation power capacity and the power storage capacity for 
J-(i) < 3.6 MW and J^^^ < 2 • 10"^ 
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5. Constrained Cmax and Dmax 

In previous sections, we assumed the rated storage power conversion Cmax and the rated storage 
output power Dmax to be unconstrained. In this section, we relax this condition and show that 
Theorems 1 and 2 can be extended to find the optimal policies with constraints on Cmax and D^^. 
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Consider stochastic program I with the additional constraints 



max _ 




max 5 



-C^max £ O^^Sj 



max* 



(8) 



In this following, we show that the optimal policy is a slight generalization of the policy for the 
unconstrained case in Theorem 1. 

Theorem 3. The optimal policy vr*^^-' for stochastic program I with constrained Cmax and i^max is 
equal to 



Cf ' = min{Ci^\ C„,J, ^ = min{L>«, i^max}, ^ = min{[Cf ^ - d'^'^ - A,]+, G^^.}- 



The proof of this theorem is given in Appendix A. Note that the above optimal policy reduces to 
the policy in Theorem 1 when OcCmax = ^max/ad = -^max- 

Similarly, we can consider stochastic program II with constraints in (8) and obtain a generaliza- 
tion of the policy for the unconstrained case in Theorem 2. 

Theorem 4. The optimal policy tt^*' for stochastic program II with constrained Cmax and Dmax is 
equal to 



' = min{Cf \ Cmax}, A-'^ = min{L>f \ Dmax}, G<f^ = min{[Cf ^ - D^'^ - A,]+, Gma.}- 



The proof of this theorem is given in Appendix A. 

Figure 17(a) plots the minimum average conventional generation versus QicCmax/S'max = 
-Dmax/od'S'max for thc NREL wlud dataset and the CAISO load dataset. Note that most of the 
reduction in conventional generation in the unconstrained case can be achieved by Cmax and -Dmax 
less than 5% of S'max- 

Figure 17(b) plots the minimum expected average loss of load probability versus OcCmax/'S'max = 
-Dmax/cKd S'max- Note that loss of load may occur even when the stored power is high due to con- 
strained -Dmax- Thus, the loss of load probability can be much higher for small Cmax and .Dmax than 
for the unconstrained case. 
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Figure 17 Figures (a) and (b) show the minimum expected average conventional generation and the minimum 
expected loss of load probability, respectively, versus power storage capacity for round-trip efficiencies 
a = 60% and 80% and Cmax = Gmax = 160 MW. 
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6. Conclusion 

This paper aimed to answer questions concerning the hmits on the benefit of energy storage to 
renewable energy: How much can storage help? how much storage is needed? and what are the 
optimal control strategies for storage? To answer these questions, we formulated a single-bus power 
system with storage as stochastic program and established the optimal policies when the cost 
function is the expected average conventional generation and when it is the expected average loss 
of load probability. We proposed a general two-threshold policy for which these optimal policies 
are extreme special cases. We obtained refined analytical results by considering the multi-timescale 
operation of the grid. The results lead to the following potentially useful conclusions: 

• Using energy storage reduces fast-ramping generation by a factor up to the reciprocal of the 
round-trip inefficiency. 

• 80% of the reduction in fast-ramping generation can be achieved using storage capacity equal 
to only four standard deviations of the net renewable generation prediction error. 

• The loss of load probability can be reduced by an unbounded factor as the energy storage 
capacity increases. 
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• The loss of load probability can be reduced by an order of magnitude using storage capacity 
equal to only two standard deviations of the prediction error. 

• The Laplace distribution, which makes the analysis far more tractable, appears to be a rea- 
sonable approximation of the short time-scale prediction error of wind energy generation. 

• Most reduction in conventional generation for unconstrained charging and discharging rates 
can be achieved by relatively small such rates. 

We corroborated our assumptions and analytical results with simulated wind data. 

There are many open questions suggested by this work: What is the minimum expected average 
fast-ramping generation for constrained Cmax and Z^max under the Laplace assumption? What is 
the minimum expected average loss of load probability for limited G^ax and S'max? What is the 
natural cost function for which the two-threshold policy is optimal? What is the optimal policy 
when the ramping constraints of the fast-ramping generation are considered? 

Appendix A: Proofs for the single-bus power system results 

Proof of Theorems 1 and 3. Since Theorem 1 is a special case of Theorem 3, we only need to 
prove the general case. Note that the proof of Lemma 1 holds for constrained Cmax and Dmax since 
the policies in (2) and (3) have smaller charging and discharging power than the optimal policies. 
Thus, we only need to show that for any policy 7rfe_i, 

Jk-i = Gk-i + jj. ^ ( Tr*,Sk-i + acCk-i Dk-i] 

, t(1) (^* a , ^(1) 1 \ _ t(1) 

> + 'Jk , bk-1 + ac(-^k-i ^k-1 — -Jk-i- 

\ "d y 

Now we consider the following seven cases. 

1- (Smax - Sk-i)/ac < Afe_i and S^^ - a^C^^^ < Sk-i- By Lemma 1, 

Jk-l ^ Jk ^l"" } Sma,x) = ^k-l- 

2. Cmax < Afe_i and S^-i < S'max - "cCmax- By Lemma 1, 

Jk-l ^ Jk ^(tT*, S'fc-i + OcCmax) = «^fe_V 
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3. < Afe_i < min{(S'max - Sk-i)/a^, C^ax}- 

• If aCk-i — Dk_i < q;(7^^\ — -D^^j, then by Lemma 1, 

Jk-i>Ji'^ (^7r*,S,_, + a^Ci'},-l-Di'l,^=ji'2,. 

• If aCk-i — Dk-i > aCl.]!-^^ — -Dfcl^i, then by Lemma 1, 

- ftd ^ac(Cfe_i - Afc_i) - ^Dfe_i^ > 0. 

4. — minladiSfc-i, -Dmax} < ^k-\ < 0. The proof is the same as the previous case. 

5. -Gmax - a^Sk-i < Afc_i < -a^Sk-i and Sk-i < D^^^/a^. By Lemma 1, 

Jk-i — Jk-1 — Ck-1 — Dk-i + a^Sk-i 

- ttd ( Sk-i + a^Ck-i - —Dk-i] > 0. 

6. — Gmax — -f^rnax < ^k-1 < " -C^max -Dmax/ Q^d < iS/s-i . 

«^/c-l ~ Jk-1 — ^k-1 ~ Dk-l + -Dmax 

-Qd ( "cCfe-l - — (-Dfc_i -i:)rnax) ] > 0. 

7. Afc_i < — Gmax — minjctd'S'fc-i, -Dmax}- By the definition of loss of load, Jk-i = Jik-i- 

By induction we show that the stationary policy vr'^^ is optimal for the finite horizon stochastic 
program. Then for any n G N and policy tt, we have 

and thus 

J(^) (tt^^) , 5i) = lim sup - ' (tt^^^ , 5i) < lim sup - ^ (tt, -Si) = J^^^ (tt, St). 

n—>oo n— >oo ^ 

Proof of Theorems 2 and 4- Since Theorem 2 is a special case of Theorem 4, we only need to 
prove the general case. Note that the property of the cost-to-go function in (5) holds for constrained 
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Cmax and Djnax smcB the policy in (2) have smaller charging and discharging power than the optimal 
policy. Thus, we need to show that for any policy iTk-i, 

Jk-i = l{Afe_i<-Gmax-adSfc_i} + Jk^ i'n*,Sk-i+acCk-i -Dfc-i ) 

Since the first terms in Jk-i and J^!.\ are equal, by (5), we only need to show that 

Sk = Sk-i + (XcCk-i Dk-i 

> ^fc_l + Oc^-^fe-l J^k-l—'^k ■ 

Now we consider the following seven cases. 

1. If (>Sniax — Sk-l)/ Oic — Gmax ^ ^k-1 ^nd S'max — QJcC'max ^ Sk-l, then Sk < iSinax = 

2. If Cmax — Gmax < and Sfc-i < /Smax " ClcC'max, then Sk < Sk-l + CtcC'max = S^ . 

3. If -Gmax < Afc_i < min{(5„iax - Sk-i)/a^, C^^^} - G^^^, then 

Sk < Sk-l + ac(Gfc_i + -Dfc-i + Afc_i) Dk-i 

< Sk-l + ac(Gmax + Afe_i) = 5f \ 

4. If — min{adS'^;_i, -D„iax} ~ G'max ^ A;;_]^ < — Gmaxi then 

Sk ^ Sk-l + a^Ck-i (— + Cfc_i — Afe_i) 

< 'S'fe-i ( — Gmax — Afc_i) \ 

5. If Afc_i < -G„,ax- min{Q!dS'fc_i,-Dmax}, then Sfe = Sf'^ = Sk-i - min{S'fe_i,D^ax/ad}- 

By induction we show that the stationary pohcy tt^^' is optimal for the finite horizon stochastic 
program. Then for any n G N and policy tt, we have 

ljf)(;r(^),5,)<-jP(vr,50, 
n n 

and thus 

J(2) (7r(2) , 5i ) = lim sup - Jf ^ (7r(2' ,Si)< lim sup - jf ' (tt, 5i ) = J^^) (tt, ) . 

n— ^oo ^ n— ^oo ^ 
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Appendix B: Proofs for the residual power system results 

We first establish bounds on the expected stored power. 

Lemma 2. Suppose that S'j+i = maxjS'j + 0j, 0} for i = 1,2, . . . ,n, where Si>0 is given and Qi, 
i = \,2, . . . ,n, is a sequence of IID random variables with mean jiQ and variance cr| . 

1. //Me < 0, then E[Sfe+i] < k{jil + a^) + SI . 

2. Ifn@>0, then Si + kfie<E[Sk+i]<kf^@ + Vklog2i4.k)^^l + Sf/k. 
For /^e < 0, consider 



where the equaUty follows by the independence between and (0i,02, ■ ■ ■ , ©fe-i) and the last 
inequality follows by induction. Thus, by Jensen's inequality, 



E[Sl^,]<E[{S, + @kn 



E[Sl] + 2E[S,]E[e,]+E[Ql] 



<E[Sl] + {nl + al) 



<Sl + kifil + al), 



nS,+,] < ^/e[SI^] < ^k{i,l + al) + Sf 



For jj,T > 0, the expected stored power can be lower bounded as 



E[Sk+i] > E[Sk + Ok] >Si + kne. 



where the last inequality follows by induction. Next, we consider 



E[S'fe+i - kjjLe] = E[majc{Sk + 9fc, 0} - k/ie] 




Su and El Gamal: Limits on the Benefits of Energy Storage for Renewable Integration 



35 



where ©j = ©j — //e for i = 2, 3, . . . , A: and ©i = + ©i — /xe. Note that E[©i0j] = for z 7^ j. By 
the inequaUty in Doob (1953, Lemma 4.1), we have 

<log,iik)\kal + Sl). 







k 


2' 


E 


max 
i<j<fe 


i=j 





Therefore, 





k 








k 




max 

l<j<k 




\ \ 


E 


max 

Y<j<k 




)] 







k 


2" 


E 


max 

l<j<k 







kjiQ < Vk log2 (4fc) ^J|l% + Sl/k + kjiQ , 



where the second inequality follows by Jensen's inequality. 

Proof of Propositions 1 and 6. Since Proposition 1 is a special case of Proposition 6, we only 
need to prove the general case. For unlimited Gmax and -Smax, the optimal policy tt^^^ satisfies 

= Gi-a + A + Ai, 
Si+i = Si + a^Ci Di, 

for i = 1, 2, . . .. Note that = A+. Now we consider 

n 1 ^ 

-E^^ = -E(^«-^^-^^) 

i=l 

1 " 1 

= - V ((1 - a)Ci - A,) + -ad(5„+i - S,) 
n ^-^ n 
1=1 

" 1 
= - V ((1 - a)A+ - A,) + -ad(5„+i - S^) 

" 1 

= - V (A- - aA+) + -ad(5„+i - 



i=l 



Next let ©i = CKcAt — Ar/ctd for i = 1,2,..., which is sequence of IID random variables with 
mean jiQ = E[0j] and variance ctq = Var[©j]. Suppose that jiQ > 0. We only need to show that 
limsup„^^E[S'„+i/n] = since 



lim sup E 



" n 

-E(A.--«Ar)-ia.S, 



:E[A--Q!A+]. 
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By Lemma 2, we have 



< lim sup E 



— >->n+l 

n 



< lim sup — V^n(/i|"+o^y+"Sf = 0. 



Suppose that /Iq < 0. By Lemma 2, we have 



lim sup E 



1 " 1 

-^Gi <E [A- -aA+] +limsup-Q;d (n/Lte + V^log2(4fe)A/a|Tsf7") 

1=1 J 

= E [A" - aA+j + ad/xe = 0. 



Proof of Proposition 2. To find the expected average fast-ramping generation J''-^\7r'^^\ Si) 
under the Laplace assumption, we need to show that there exist a constant r] and a function v{s) 
such that 



ri + v{s) = E 



(9) 



and lim„^ooE [u(5„)/n] = Arapostathis et al. (1993). Then the average cost is equal to rj. Now 
we verify that for the policy tt^^^ in Theorem 1, 



v = — 

v{s) 



1 / (l-a)(l-e 



-AGn 



2A V 1 - ae-(i/«<:-"d)>'S'max/2 



1 _ Q,e-(V«c-ad)ASmax/2 y AVI 



1 fl + a 
s + - 

a 



Q,g-(l/ac-a;d)A(Smax-s)/2 
1 — Q,g-(l/a;c-Q:d)^'5max/2 



(1-e 



-AGn 



satisfy 

r] + v{s) = E 
= E 



■t-l^max); — (Smax - s) < A 



+ E 



u(s + acA); < A < — (S^^x - s) 



+ E 



sH A ;-Q;dS< A<0 

CKd 



+ E [- A - CKdS + l'(O) ; -Gmax - QldS < A < -ttdS] 



+ E[G^a. + v(0);A<-G' 

max 



where for random variable X and a set A we define E[X;^] =E[X|^]P(^). Furthermore, v{s) is 
bounded for s G [0, 5max] , and thus lim„_^oo E [w(5'„) /n] = 0. 
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Proof of Proposition 3. Since the optimal policy vr^^' in Theorem 1 is stationary, the corre- 
sponding stored power sequence is a Markov process 

'S'max if ('S'niax ^ Si) / O.^ ^ Aj 

S", +QcAj if 0< A, < (S'max-S'j)/ac 

Si + Aj/aj if — oi^\Si < Aj < 
Let Fi be the cdf of S*,. Then for < s < S'max, 



Si+i — ^ 



-a;(j(Smax-s) ^ ["^ \ 

-adC-^max-s) 

(l/ac)s \ 

+ 1 -e-^'Fi{s-aJ)d6. (10) 



/-ajl^^imax-s; \ f X 

OO J —ar\(Sinax—s) 

I 



For s < 0, Fi_|_i(s) = 0, and for s > S^iax, -fi+i('S) = 1. Let 

'O ifs<0 



1 _ i±^e-(i/«c-«d)A./2 ] if < s < 5„ 



\ Q;g-(l/Q:c-Q:d)-^'S^max/2 

1 if S > iSmax 



It can be verified that (10) is satisfied with Fi{s) = Fj+i(s) = Fs{s) for all s. Now we only need to 
show that the Markov chain is irreducible with respect to Fs{s), which implies that the stationary 
distribution is unique (Gilks et al. 1995, Theorem 4.1). Let Si = s, and let B = (6i, 62) C [s, S^ax] 
such that P{S eB} >0 under the stationary distribution, that is, 62 > ^i- Then 

/•(i/«c)(fe2-s) \ 1 

P{5,+i eB}= ^e-^'d6 = ^ (e-(V«c)A(6i-«) _ g-(l/„e)A(62-«)) > 0. 

i(l/c<c)(6i-s) 2 2 

If H = {0}, then 

_e^*d(5 = -e-«d^«>0. 

Similarly, PlSi+i G > for H = (61, 62) C [0, s] and {S'max}- Thus, we can generalize B to any set 
such that PIS' ^B}>0. Therefore, the Markov chain is irreducible. 

Using stationary distribution Fs{s) above, we can find the corresponding distribution of the 
fast-ramping generation 

/-ff-^d'S^max \ r-g \ / 1 \ 

^e^'dS- / ^e''Fs(--iS + 9))d5 
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1 _ l^--'^5-AadSn 

2 



\ Q/g-(l/ac-ad)-^'S'max/2 

1-a 



( 1 ^Q->^3 _ Ig-As-AcdSmax i ^g-Ag-(l/c«c+ad)ASmax/2 ] 

V 2 2 2 J 



= 1 1 - e~^^ 

2(1 - Q;e-(V«c-ad)ASmax/2) 

for g>0, Fcig) = for 5 < 0, and ^^(5) = 1 for 5 > G^^^. 
Proof of Proposition 4- For any Uk+i > 0, consider 

^ [l{Afc+i<-Gmax-adS'fc+i}] = E [l{Afc+i<-Gmax-adSfc+i} ; 'S'fe+l < 

+ E [l{Afc+i<-Gn,ax-adSfe+l};5'fc+l >Uk+l] 
< P{Sk+l < Uk+l} + E [l{Afc+i<-G„,ax-ad«fc+i}] 
= P{5fe+i < Uk+l} + -PA(-Gmax " Qid'Wfe+l) 

and choose Uk+i = E[Sk+i]/2. Then we have 

E [l{Afe+i<-Gmax-adSfe+i}] < P I'S'fc+l < -EfSfe+ijj +Fa ^-Gmax ^_L±til^ 

lT:^rc il , P adE[5fe+i] 



= P |E[,Sfe+i] - Sk+i > - E[Sk+i] I + Fa \^-G, 

For unhmited Smax, under the optimal pohcy tt^^' in Theorem 2, we have 

S'fe+l = max {S'fc + Bfc, 0} , 

where 0fe = ac(Gi„ax + Afe)+ - (l/ad)(Gmax + Afc)" with /i© = EfGfe] > and cr| = Var[efe]. Then 
by Lemma 2, 

5i + kfxe < E[5'fe+i] < kne + Vk\og2{4k)^al + Sf/k. 

Thus, 

E [l{A,+i<-G.a.-adS.+i}] < ? {(E[-Sfc+i] - Sk+,f > ^ ElSfe+ij^j + Fa (-G^ax - Y(/c/xe + S,)) 

4Var(,Sfe+i 



4Var(,bfe+i , p / ^ "d., , c- ^^ 

4Var(Sfc+ 
(fc/ie + 'S'i 
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where the second inequahty follows by the Chebyshev's inequality. Next we consider 



Yar[S,+,]=E[Sl^,]-mS,+,]f 

<E[{Sk + @kf]-{kfie + S^f 

= E [Si] + 2//e E[Sk] + ifil + 4) - {k^ie + S,] 



< E [Si] + 2/xe \^{k - l)/xe + Vk^logM^ - 1)) ^4 + j^j + (/^l + 4) - (^/^e + S,) 

k 

< J2 ((j - l)/^0 + Vr^logMj - 1)) + + Kf^l + 4) + S! - {kfie + S,)' 

k 

= J2 2fieVr^logMj - 1)) V4+^ + Hk - 1)4 + ^(4 + 4) + S! - {kfie + S.f 

fc 

= 2fieVr^logMj - 1)) V4+^ + ^(4 - Wi), 



3=2 



where the third inequality follows by induction. Since 

-ln4(j-l 



3=2 



In 2 



1 Ji 



31n2 



for some constant Ci > and for sufficiently large k, we have 



/c=l k=l ^ ^ ' 

where k', C2, and C3 are constants. 

Proof of Proposition 5. We first establish a lower bound on the expected loss of load probability. 
For power storage capacity 5max5 J'''^\'^''^\ Si) is lower bounded by the expected loss of load 
probability associated with the stored power sequence 5, = Smax for i = l,2,..., that is, 

1 



j(2)(7r(2),5i)>linisupE 



1 ^ 

i^l 



■ Gmax CK(i Smax } 
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Thus, 

Inj(^)(7r(^),g0 ^ 

lim > -UdX = 7min- 

Next, we establish an upper bound on J'^^) (tt, ^i) by considering a suboptimal policy tt in Table 4. 
Let Fi be the cdf of 5j. Then for < s < S^^, 

-OO ^ ^-Gmax-ad('Smax-s) ^ V Old / 

/O \ /.(l/ac)s \ 

_ ^g — AGmax-ad-^('Smax — s) _j_ C"'^'^™'"' ) i^' (s) 

-Gmax-c«d(Smax-s) ^ V CXd / Jo ^ 

It can be verified that 

Fs(s) = ^''""""^ (-1 + ^ + " e^°- 

Q/gAo5'max g — AGmax \ ]^ _|_ g — AGmax 

for < s < /Smax is a stationary distribution of the process {Si}, where 



< Ao = — — — — tA < aaA. 

By similar steps in the proof of Proposition 3, we can show that the stationary distribution is 
unique. Thus, 



/ — Gmax — CKd^max \ Gmax \ / X J_ \ 

OO ^ ^-Gmax-adSmax ^ \ / 



1 / a — e 

_g-AG„ 



^max 



gAo5'max — 



max 



Therefore, 

hm < -Ao = 

*max— >cx) iJmax 



Su and El Gamal: Limits on the Benefits of Energy Storage for Renewable Integration 



41 



Table 4 Suboptimal policy for minimizing loss of load probability. 





Gi 


Ci 


A 


Si+i 














etc 




ac 






< Ai < ~ 





Ai 





Si + acAi 


ac 










-C?max < Ai < 


-Ai 








Si 


— Gmax — OCiiSi < Aj < — Gmax 







Gmax 


^ 1 Gmax + Ai 












Ai < —Gmax — (XdSi 


^max 





OidSi 






Appendix C: Proofs for the two-threshold policy 

Proof of Proposition 7. Define the cost-to-go function of the two-slot dynamic program 



y^PlQj +P2l{Ai<-Gmax-adg. 



Sk 



for k = l,2. When k = 2, the expected loss of load probability is equal to 

E [p2l{A2<-Gmax-adS'2}| 'S'2] 

and does not depend on the control 1:2 = {G2,C2,D2). Thus, the optimal policy at time 2 is = Tr^\ 
that is, Sc2 = Sd2 = 0. Then the minimum cost-to-go function is 



J2(7r*,S'2) -E PiGa^^ +p2l{A2<-Gmax-adS2} 



-ad'S'2 



(PlGmax + P2)/A((^2) ^^2 - / Pi<^2/a(^2) d'^2- 

— Gmax— ad'52 



Note that J2(7r*,S'2) is convex and decreasing in S2 since 

5J2(7r*,52) 



dSo 



-Q;d(PlG'i„ax + P2)/A(-Gi„ax " Oid'S'2) 



/-«d'52 
adPl/A(<52) (^(^2 
-Gmax — 0!d'S'2 

= -Q!dP2/A(-Gmax " ad'S'2) - / Q!dPl/A((^2 - Q:d'S'2) c/(^2 < 0, 

*^ —Gmax 

and /a in increasing on (— oo,0]. 

Next we consider the cost-to-go function at time 1. For a policy tti, we consider the following 
three cases. 
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1. If Ai>(S^,,-5i)/ae, then 

PlGi + P2l{Ai<-Gi„ax-ad'Si} + J2 ( TT*, 5i + tt^Ci Di) > J2 (tT*, Smax) , 

where the lower bound is achieved by Gi = 0, Ci = (Smax — S'lj/ctc and Di = 0. 

2. If Ai < — Gmax — ctdSi, then loss of load occurs, and Gi = Gmax, Ci = 0, and Di = a^Si. 

3. For — Gmax — "d'S'i < Ai < (5i„ax — Si)/ac, consider the lower bounds on fast-ramping gener- 
ation 

Gi > — -S^ + —D,] -D,-A,> -Ai + — (^2 - S,), 
ftc V "d y "c 



Thus, we have 



Gi > Cl - ad(acCi + - ^s) - Ai > -Ai + a^iS^ - S,). 



PiGi -I- J2(7r*, ^a) > max{ Jic(5'i, ^z), Jid!-?!, ^z)}, 



where 

Jid(.5i, 52) = Pi (-<5i + a^iS^ - 50) + J2(7r*, -S2). 

Note that the loss of load probability p2l{Ai<-Gmax-c«d'Si} ^^^^ depend on the control. By the 
convexity of J2(7r*, 52), Jic{Si,S2) and JidC-^i, 52) are convex. Let 

= arg min MS„S2) = f sup |o < ^2 < -S^ax : ^-^'^f^^^'^ < 



0<S2<Smax \ I, Oc 

dJ2_in^S2] 

0<S2<Smax V I 5^2 



Sdi= argmin Jid(5i, ^2) = |^sup |o < ^2 < ^^ax : ^^^^^^^^r^ < -"dPi 



+ 



where sup0 = —00. Then < 5ci < < iSmax- If Ai > —Gmax, then < S2< minis'! + ac(Gmax + 
'^))'S'max}, and if Ai < —Gmax, thcu < S2 < maxj^i + (Gmax + S)/ad,0}. Therefore, the two- 
threshold policy with parameters (5ci,5di) is optimal at time 1 by the convexity of Jic and Jij. 
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